PROCEEDINGS 

OF SCIENCE 



RBRC-778 



Improving chiral property of domain-wall fermions 
by reweighting method 



Tomomi Ishikawa f 

RIKEN BNL Research Center, Brookhaven National Laboratory, Upton, NY 11973, USA 



E-mail: tomomi@quark . phy . bnl . gov 



Yasumichi Aoki 

RIKEN BNL Research Center, Brookhaven National Laboratory, Upton, NY 11973, USA 



E-mail: |yaoki@ quark . phy . bnl . gov 



Taku Izubuchi 

RIKEN BNL Research Center, Brookhaven National Laboratory, Upton, NY 11973, USA 
Physics Department, Brookhaven National Laboratory, Upton, NY 11973, USA 
E-mail: 



izubuchi@ quark . phy . bnl . gov 



for the RBC and UKQCD Collaborations 

The reweighting method is applied to improve the chiral property of domain-wall fermions. One 
way to achieve this is to enlarge L s , the size of fifth dimension, which controls the size of the 
induced chiral symmetry breaking. While this is a type of reweighting method for shifting the 
action parameter, it seems non-trivial since this reweighting means change of the five dimensional 
lattice volume. In this report, we address issues in this direction of reweighting and evaluate its 
effectiveness. 



The XXVII International Symposium on Lattice Field Theory 

July 25-31, 2009 

Peking University, Beijing, China 



* Speaker. 

'Present address: Physics Department, University of Connecticut, Storrs, CT 06269-3046, USA 

© Copyright owned by the author(s) under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike Licence. http://pos.sissa.it/ 



Improving chiral property of domain-wall fe unions by reweighting method 



Tomomi Ishikawa 



1. Introduction 

The domain- wall (DW) fermion is one realization of the chiral fermion on the lattice [p], ®- 
The DW fermion is constructed by five-dimensional formulation and has a parameter L s , which 
is the size of fifth-dimension. The chiral symmetry is realized in the L s — > oo limit. In realistic 
simulations, however, we have to take L s finite; then the symmetry is slightly broken. The broken 
chiral symmetry causes the additive mass shift m res , which is called "residual mass" and captures 
the degree of the breaking. In recent lattice QCD simulations, we are going into the regime in 
which the quark mass is lighter and lighter and the volume is larger and larger. The computational 
resources required are becoming ever larger in these situations, and then it is hard to take L s so 
large that the chiral symmetry breaking is negligible. Improving the chiral property of the DW 
fermion is a natural thought in this circumstance. One possible strategy for this is introducing 
a twisted mass term to improve chiral symmetry, which suppresses a topology change [||, |J. 
Recently, reweighting techniques are becoming widely used in QCD simulations. Among them, 
the reweighting for the quark mass parameter is intensively applied and seems effective [Q]. Our 
aim of this work is to enlarge L s by reweighting to improve the chiral property. In this report, 
we discuss some techniques toward the reweighting and its effectiveness. We also perform some 
simple tests as an experiment using Nf = 2 + \ dynamical DW fermion configurations with volume 
L 3 xrxL. v = 16 3 x 32 x 8: 

Conf [A] : j8 = 2.30 (Iwasaki gluon), m ud = 0.04, m s = 0.04, M 5 = 1.8, (1.1) 
Conf [B] : j8 = 2.13 (Iwasaki gluon), m ud = 0.02, m s = 0.04, M 5 = 1.8, (1.2) 

produced by RBC and UKQCD Collaborations [§]. 

2. Reweighting factor 

We consider to reweight configurations with L s 
the strange quark sector even though we use Nf - 
assume the effect is small and the present work is at 
can be simply written by 
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where D,(m^) represents DW Dirac matrix with parameter set (L s =L i -,my), where rrif represents 
the bare DW fermion mass. One of the interesting features of this reweighting is the existence of 
the part with nif ■ = 1 which is coming from Pauli-Villars (PV) field of the DW formalism. While the 
systems before and after the reweighting have different five-dimensional volume, the effect coming 
from the PV sector cancels out a large portion of the volume factor. 

3. Stochastic estimation of the reweighting factor 

In order to estimate the reweighting factor we use the stochastic estimator with random Gaus- 
sian noise. When we consider the determinant of a matrix O, its stochastic estimation can be 



= L\ to Lj (L\ < Li). In this work, we ignore 
= 2 + 1 dynamical configurations because we 
an experimental stage. The reweighting factor 
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expressed by 

- =^^ i = <^'-*> { =<0„ (3-D 

where (■■■)? denotes the ensemble average over complex random Gaussian noise vector E, . A naive 
estimation, however, might end in failure. In this section, we explain several implementations used 
in the reweighting. 

3.1 Canceling the fluctuations 



A naive way to estimate the reweighting factor w ( |2.1| ) is to calculate each determinant sep- 
arately by using Eq. (fO|). This way, however, is not efficient, because each determinant can have 
large fluctuations. To reduce the large fluctuation, a determinant of the whole product of Dirac 
operators is estimated using one Gaussian noise E,. For the efficient sampling of e~ H , we choose 
hermitian operator £1 = <p^<p in Eq. (3.1) and thus the reweighting factor becomes w = det [<p^<j>] . 



In this work, we take an operator for as 



where a notation, £F = VD { D, is used, and the square root is implemented by the rational approx- 
imation [Q]. While we can, of course, write down the (j) without using the square root, we use it 



on puipose for later convenience (See Sec. 3.2). At the mathematical level, different order of the 



matrices in Eq. (3.2) provides exactly the same value of w. In the stochastic evaluation with finite 



statistics, however, they could give different value and we need to investigate the optimal choice. 

3.2 Breaking up determinants 

While the statistical average of e~ H always converges the correct determinant det £2 in the 
infinite number of sampling, the estimation deviates from the true value significantly for finite 
statistics. Moreover it is difficult to estimate the size of the error as we will see in the next subsec- 
tion. This is due to the long tail of the asymmetric distribution of e~ H , and small number of outliers 
dominate the average. To avoid this obstacle, it is found to be efficient to break up the determinant 
into many number of smaller pieces ^ so that the effect from the outliers is suppressed. One way 
for the breaking up is splitting the parameters that we want to shift in the reweighting, for example, 
splitting the mass parameter in the mass reweighting. In our reweighting, this splitting can be done 
by dividing Li — L\. 

w = det£l = deta Ll ^ h ■ detQ fe _>/ 3 • • • detn,„_,^/„ ■ detO/^^, (3.3) 

where L\ < 1% < ■ ■ ■ < l n < Li- Another possible way is to use the so-called n th root trick. It is 
provided by a simple mathematical identity: 

w = detD. = (detD. l/n Y . (3.4) 



This splitting can be easily performed by using the rational approximation for the Dirac matrices, 
and we implement it in our simulation code as we explained before. In this breakup, the magnitude 
of the fluctuations for each divided estimator is roughly l/n times smaller than original. Of course, 
a hybrid method combining these two ways of breaking-up the determinants is possible. 
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3.3 Numerical demonstration of the techniques in the reweighting 

Here we show some demonstrations for the n th root trick and combining the determinants. The 
reweighting shown here is that from L s = 8 to 10, and we used one configuration (traj id = 1000 in 
Conf [A] (0)). 

n ,h root trick 

We discuss the effectiveness of breaking up the determinant using the n th root trick. We con- 
sider the reweighting factor w (2A), and we take the n th root of it: 



- = (det foj*,])" = H(e-^n^-m % = (3.5) 



i=\ i=l 

with 



<j) n = {/@ 2 (m f ) - 1 J/^ri)— i = . (3.6) 



Fig. |l] shows the history of h in Eq. (3.5). h e s in Fig. [j] is defined by e hat = (e and represented 



by red lines. When n=\, that is, we do not impose the n th root trick, the fluctuation of h is so huge 
(~ O(10)) that the sampling is dominated by only one hit and tends to fail. We find, however, that 
the magnitude of the fluctuation of h behaves as ~ \jn as n increases, and the sampling gets close 
to the situation in which (h)? ~ /i e ff ((h) ^ is represented by blue lines). Fig. ||] shows the total // e g 
defined by e~ Hel1 = (e~ H )^. We find that the case n = 1 (no root) gives a quite misleading value, 
and as n increases, Hgg approaches an asymptotic value. Note also the jackknife errors completely 
underestimate the true error for small n. The 16 th root seems sufficient to obtain the correct value 
of the reweighting factor for this case. 
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Canceling the fluctuations 



Here we discuss cancellation of the fluctuation. As a test, we consider the 4 th root of the 
reweighting factor, and we use two kinds of stochastic estimation: 
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The history of is shown in Fig. |l| (n = 4) and of and /zl 2 are shown in Fig. ||. While 
magnitude of the fluctuation of the h\ Jl and h^ 2 is around 200, that of the /iliL 2 i s largely reduced 
to around 10. It shows combining the determinants gives us a great advantage. 



4. Fluctuation of the reweighting factor among different gauge configurations 

In this section we discuss the fluctuation of the reweighting factor w among different gauge 
configurations. Even when the correct w for a given configuration is obtained by the methods in 
previous section, too large fluctuation of w among different gauge configurations could still ruin 
the precise estimation for the final reweighed observable. As an example, we show results of the 



reweighting from L s = 8 to 16 using configurations of traj id = 1600 ~ 3500 in Conf [B] (1.2). 



4.1 Naive fluctuations 

Fig. ^| (a) shows the obtained // e ff of each configurations. The blue line in the figure repre- 
sents a constant fit line and its # 2 /d.o.f. is also put. While the // e ff's themselves seem to be well 
determined, their fluctuation among different configurations is quite large. Then we conclude that 
the overlap between original and desired configurations is so small that it is difficult to perform the 
reweighting reliably for our parameter and volume. 

4.2 Shifting parameters 

Now we consider to compensate the reweighting factor for the large fluctuation with shifting 
simulation parameters. Here we consider a gluon action with standard combination: 

Sgiuon = — j3 (cq [plaquette] + c\ [rectangle] + C2 [chair] + C3 [parallelogram] ) , (4. 1) 
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Jl/d.o.f. = 16.1 



AP = -0.0872 
Ac, = -0.0200 



2000 2500 3000 3500 
traj 



traj 



1500 2000 2500 3000 3500 
traj 




AP = -0.0914 
Ac, = -0.0196 
Ac 2 = -0.000460 
Ac 3 = -0.00134 

I I L 



1500 2000 2500 3000 3500 
traj 



(a) without shift (b) with shift of j8 (c) with shift of j8 , (d) with shift of j8, 



Figure 4: i/ e ff in configuration to configuration. 



c\,C2 and C3 





0.592 0.5925 
plaquette 



(a) plaquette 



0.349 0.3495 0.35 0.3505 
rectangle 



(b) rectangle 




0.4115 0.412 0.4125 
chair 




(c) chair 



5.4855 0.486 0.4865 0.487 0.4875 
parallelogram 



(d) parallelogram 



Figure 5: Correlation between 7/ e ff and various link loops. 



with co + 8c i + I6C2 + 8C3 = 1 (c\ = —0.331 and C2 = C3 = on the original gluon configura- 
tion (Iwasaki glue)), which means that we have a parameter space (L s ,P,ci,C2,C3,mf,Ms) in our 
simulation where M5 denotes DW height. We, however, consider shifting parameters only in the 
gluon sector /3, ci, C2 and C3 in the reweighting here. Fig. || represents the correlation of // e g 
with plaquette (a), rectangle (b), chair (c) and parallelogram (d). Each quantity is likely to have 
a linear correlation with H e s (the guide is shown by blue lines in each figures), which means a 
shift of p could help to reduce the fluctuation. And although the trends of the figures seem to be 
the same, they are still slightly different, which means we could use their combination to reduce 
the fluctuation. Figs. || (b), (c) and (d) show the // e ff changed by optimised parameter shift which 
gives the smallest # 2 /d.o.f. in the constant fit. If we keep the value of the Iwasaki gluon parameter, 
and we only shift /3, the reduction of # 2 /d.o.f. is small. (Fig. || (b)) On the other hand, we can 
largely reduce # 2 /d.o.f. by shifting c\. (Fig. ^ (c)) The shift of C2 and C3 does not contribute to the 
reduction so much. (Fig. || (d)) While the shifting the parameters in the gluon sector can contribute 
to the reduction of the fluctuation, the fluctuations are so large that we are unable to perform the 
reweighting. This indicates that we need to shift other parameters, that is, my and M 5 . 
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4.3 Balanced reweighting 

Besides shifting the parameters, an alternative idea is to balance more than one type of reweight- 
ing. While we investigate Nf = 2 + 1 simulations, the strange quark sector has not been accounted 
for in this study. When the strange quark is included, we obtain an additional handle to control the 
fluctuations. If we consider chiral symmetry for the strange quark is less important than that for the 
up and down quarks, the fluctuation could be suppressed by enlarging the L s for the up and down 
quarks while reducing L s for the strange, assuming large cancellation of the fluctuation between 
the light and strange quark sector. The up and down sector and the strange sector are, as it were, 
well "balanced". One of the other application of this concept might be the mass reweighting. For 
example, in the Nf = 2 + \ simulation one could reduce the fluctuations between the reweighting 
due to the shift of degenerate up and down sea quark mass m„j, and that of strange sea quark mass 
m s to the first order in the mass shift Am: 

w{m u d m u d — Am; m s — > m s + 2aAm) = 0(Am 2 ), (4.2) 

where a is one at m u d = m s and is decreasing function of (m s — m u d), to be tuned by numerical 
calculations. 

5. Summary 

In this report, we discussed the possibility of the reweighting method to enlarge L s . The 
reweighting factor itself can be calculated correctly by using the stochastic method, dividing the 
determinant into many pieces. In this study we used the n th root trick to control this, and we found 
that it works well. The problem lies on the fluctuation of the reweighting factor among different 
gauge configurations, which is quite large. Although we tried to make parameter shifts in the gluon 
sector to cure the situation, it seems not enough. We consider that additional parameter shifts, like 
nif and M$, are needed to suppress the fluctuation. We are going to address this issue in the future. 

Acknowledgments 

We acknowledge RIKEN, BNL and the U.S. DOE for providing the facilities on which this 
work was performed. We also thank Chulwoo Jung for helpful discussions. This work is supported 
in part by JSPS KAKENHI 21540289. 

References 

[1] D. B. Kaplan, Phys. Lett. B 288, 342 (1992) [arXiv:hep-lat/9206013]. 
[2] Y. Shamir, Nucl. Phys. B 406, 90 (1993) [arXiv:hep-lat/9303005]. 

[3] D. Renfrew, T. Blum, N. Christ, R. Mawhinney and P. Vranas, arXiv:0902.2587 [hep-lat]. 

[4] H. Fukaya, S. Hashimoto, K. I. Ishikawa, T. Kaneko, H. Matsufuru, T. Onogi and N. Yamada [JLQCD 

Collaboration], Phys. Rev. D 74, 094505 (2006) [arXiv:hep-lat/0607020]. 
[5] A. Hasenfratz, R. Hoffmann and S. Schaefer, Phys. Rev. D 78, 014515 (2008) [arXiv:0805.2369 

[hep-lat]]. 

[6] D. J. Antonio et al. [RBC Collaboration and UKQCD Collaboration], Phys. Rev. D 77, 014509 (2008) 

[arXiv:0705.2340 [hep-lat]]. 
[7] M. A. Clark and A. D. Kennedy, Phys. Rev. Lett. 98, 051601 (2007) [arXiv:hep-lat/0608015]. 



7 



